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Abstract. For a first-order formula (f(x;y) we introduce and study the characteristic se- 
quence (P n : n < lu) of hypergraphs defined by P„(yi, . . . , y n ) := (3x) /\,-< n ip(x; j/j). We 
show that combinatorial and classification theoretic properties of the characteristic sequence 
reflect classification theoretic properties of <p and vice versa. Specifically, we show that some 
tree properties are detected by the presence of certain combinatorial configurations in the 
characteristic sequence while other properties such as instability and the independence prop- 
erty manifest themselves in the persistence of complicated configurations under localization. 



1. Introduction 

This article defines and develops the theory of characteristic sequences. The characteristic 
sequence (P n : n < uj) associated to a first-order formula <p is a countable sequence of hy- 
pergraphs defined on the parameter space of ip; this association allows for a new description 
of the combinatorial complexity of y-types in terms of graph-theoretic complexity of the 
hypergraphs P n . The construction arose from work of the author on saturation of ultrapow- 
ers, as described briefly below, but is of independent interest. There is a model-theoretic 
sensibility throughout, but many of the arguments are combinatorial. In fact, if the reader 
is familiar with basic model theory and is willing to take on faith the interest of certain 
classification-theoretic dividing lines, the article is largely self-contained. 

At first glance, the characteristic sequence gives a transparent language for the kinds 
of arguments which occur in many contexts where the fine structure of dividing or thorn- 
dividing is being analyzed. However, there is a power in our general framework which accrues 
from the fact that the formulas P n are simultaneously: 

(1) graphs, so we can ask about their complexity in the sense of graph theory; 

(2) formulas definable in the background theory T, so we can ask about their complexity 
in the sense of classification theory; 

(3) descriptors of the parameter space of the given formula ip. 

Leveraging these three contexts against each other puts strong restrictions on the behavior 
of the P n . We obtain, for instance, a description of NIP theories as theories in which any 
initial segment of the characteristic sequence is, after localization, essentially trivial: see 
§6 below. In some sense, then, the characteristic sequence is a tool for analyzing the fine 
structure of the independence property and gives a natural description of its complexity. 

By way of describing the kinds of complexity we consider, let us briefly mention two 
background motivations for this work. The first is a deep question of Keisler about the 
structure of a preorder on countable theories which compares the difficulty of producing 
saturated regular ultrapowers [3]. Shelah described the structure of this so-called Keisler 
order on NIP theories in a series of surprising results, collected in [8] chapter VI, but its 
structure on theories with the independence property remains open. In [5J, [7] we showed 
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that the structure of Keisler's order on unstable theories (thus, on theories with the inde- 
pendence property) depends on a classification of y?-types, and specifically on an analysis of 
characteristic sequences. 

The second motivation is the development of a new language for interactions between 
model theory and graph theory. An interest in the complexity of <y?-types asks how the many 
finite fragments of configurations cluster in the characteristic sequence and how uniformly or 
regularly they are distributed. These are issues which graph theory is particularly articulate 
at describing. Insofar as properties like edge density and edge distribution and structural 
properties of the hypergraphs P n can be shown to have model-theoretic content, this opens 
up the possibility of using a deep collection of structure theorems for graphs to give model- 
theoretic information [7]. 

The organization of the article is as follows. Section 2 contains definitions and basic 
properties. Section 3 gives several motivating examples. Section 4 gives a series of "static" 
arguments relating configurations in the characteristic sequence to classification-theoretic 
dividing lines. Section 4 begins the work of separating out inessential complexity from the 
base set of a type under analysis via localization; persistence and its associated "dynamic" 
arguments are motivated and defined. Section 6 describes NIP, simplicity and stability in 
terms of persistence. 

Acknowledgments. This is work from my doctoral thesis at Berkeley under the direction of 
Thomas Scanlon. I would also like to thank Leo Harrington for many interesting discussions 
and John Baldwin for helpful remarks on an early version. Part of this work was done as a 
Mathlogaps fellow in Lyon and I thank the Equipe de Logique for their warm hospitality. 

2. The characteristic sequence 

Definition 2.1. (Notation and conventions, I) 

(1) Throughout this article, if a variable or a tuple is written x or a rather than x, a, this 
does not necessarily imply that £(x),£(a) = 1. 

(2) Unless otherwise stated, T is a complete theory in the language C. 

(3) A graph in which no two elements are connected is called an empty graph. A pair 
of elements which are not connected is an empty pair. When R is an n-ary edge 
relation, to say that some X is an i?-empty graph means that R does not hold on 
any n-tuple of elements of X. X is an incomplete graph if R holds on every n-tuple 
from X. 

(4) Important: (p n (x; y±, ■ ■ ■ y n ) denotes the formula f\ i<n <p{x; yi). 

(5) In discussing graphs we will typically write concatenation for union, i.e. Ac for 
AU{c}. 

(6) A formula ip(x;y) of £ will be called dividable if there exists an infinite set C C P\ 
and k < uj such that {ip{x;c) : c G C} is 1-consistent but k -inconsistent. (Thus, by 
compactness, some instance of ip divides.) When it is important to specify the arity 
k, write fc-dividable. 

(7) A set is /c-consistent if every k-element subset is consistent, and it is ^-inconsistent 
if every k-element subset is inconsistent. 

To each formula <p we associate a countable sequence of hypergraphs, the characteristic 
sequence, which describe incidence relations on the parameter space of ip. The idea is to give 
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an analysis of yj-types by describing the way that certain distinguished sets A (the complete 
Poo-graphs, avatars of consistent partial <p- types) sit inside the ambient hypergraphs P n . 

Definition 2.2. (Characteristic sequences) Let T be a first-order theory and <p a formula of 
the language ofT. 

• Forn < uj, P n (z 1 , . . . z n ) := 3x /\^ n (p(x; Zi) . 

• The characteristic sequence of if in T is (P n : n < uj). 

• Write (T, cp) \— > (P n ) for this association. 

• Convention: we assume that T h \/y3z\/x((p(x; z) -i<p(x;y)). If this does not 
already hold for some given <p, replace <p with 8(x; y, z) = <p(x; y) A ->(p(x; z). 

Convention 2.3. Below, we will ask a series of questions about whether certain, possibly 
infinite, configurations appear as subgraphs of the P n , or of the Pj[ in some finite localization, 
Definition 15.11 For our purposes, the existence of these configurations is a property of T. 
That is, we may, as a way of speaking, ask if some configuration X appears, or is persistent, 
inside of some P n ; however, we will always mean whether or not it is consistent with T 
that there are witnesses to X inside of P n interpreted in some sufficiently saturated model. 
Certainly, one could ask the question of whether some given model of T, expanded to model 
of the P n , must include witnesses to X; we will not do so here. Thus, the formulas P n will 
often w.l.o.g. be identified with their interpretations in some monster model. 

Definition 2.4. (Notation and conventions, II) 

(8) Pqo will be shorthand for the collection of predicates P n when the context (of a given 
condition, not necessarily definable, which holds of P n for all n) is clear, e.g. A is a 
Poo-complete graph meaning A is a P n -complete graph for all n. 

(9) The complete P^- graph A will be called a positive base set when the emphasis is on 
its identification with some consistent partial ip-type under analysis, as described in 
Observation \2.3i 5). 

(10) The sequence (P n ) has support k if: P n (yi, . . . y n ) iff Pt holds on every k-element 
subset of {yi, . . . y n }. See Remark UTR 

(11) The element a e Pi is a one-point extension of the P n -complete graph A just in case 
Aa is also a P n -complete graph. In most cases, n will be oo. 

Observation 2.5. (Basic properties) Let (P n : n < uj) be the characteristic sequence of 
(T,<f). Then, regardless of the choice ofT and tp, we will have: 

(1) (Reflexivity) \/x(Pi(x) — > P„(x, . . . x)). In general, for each £ < m < to, 




(2) (Symmetry) For any n < uj and any bisection g : n — > n, 
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(3) (Monotonicity) For each i < m < to, 

\/z 1 ,...z e ,y 1 ,...y m ( ({z u ...z £ }C { Vlj . . . y m }) 

{PrniVU- ■ -Vm) Pl{zi, ■ ■ ■ Z t )) J 

So m particular, if \= P m (yi, . . . y m ) and £ < m then Pi holds on all l-element subsets 
of {yx, . . . y m }- The converse is usually not true; see Remark [Ol 

(4) (Dividing) Suppose that for some n < uj, it is consistent with T that there exists an 
infinite subset Y C P n such that Y k D P n k = 0- Then in any sufficiently saturated 
model ofT, some instance of the formula ip n (x; yi, . . . y n ) = f\ i<n <f{x; yi) k-divides. 

(5) (Consistent types) Let A C P\ be a set of parameters in some M |= T. Then 
{<p(x; a) : a G A} is a consistent partial tp-type iff A n C P n for all n < u. 

Proof. (4) By compactness, there exists an infinite indiscernible sequence of n-tuples C = 
(c\, . . . c l n : i < uj) such that C k fl P n k = 0. The set {f n (x; c\, . . . d n ) : i < u>} is therefore 
fc-inconsistent. However, it is 1-consistent: for each c\,...c l n G C, M \= P n (c\, . . . c l n ), so 
M \= 3x(f n (x; c\, . . . ci). " □ 

Convention 2.6. (T -configurations) Throughout this article, let T denote the incomplete 
theory in the language £q := {P n : n < u} U {=} which describes (l)-(3) of Observation \2.5\ 
Blueprints for hypergraphs in the language Cq which are consistent with T Q will be called T - 
configurations. That is: a finite T -configuration is a pair X = (Vx, Ex) where Vx = n < uj, 
Ex Q V(n) and the following is consistent with Tq: 

(1) (3x u . . .x n ) (Vo- C n, \a\ = i,a = {£ 1} . . .^}) \Pi(x ei , . . . x ti ) ^> a G E x 

In general, the domain of a Tq- configuration may be infinite; we simply require that its 
restriction to every finite subdomain satisfy ([!]). These are the graphs which can consistenly 
occur as finite subgraphs of some characteristic sequence. That every such graph appears in 
some sequence follows from Example \3.S\ below. 

Convention 2.7. (Ti-configurations) Fix T,(p, and the associated sequence (P n : n < uj). 
Let M |= T; there is a unique expansion of M to Cq = {P n : n < uj} U {=}• Throughout 
this article, whenever T,(p, (P n ) are thus fixed, letTi denote the complete theory of M in the 
language Cq. As the characteristic sequence is definable in T, when T is complete this will 
not depend on the model chosen. 

Hypergraphs in the language Cq which are consistent with T\ will be called Ti-configurations. 

Recall that a formula f(x;y) has the finite cover property if for arbitrarily large n < uj 
there exist a , . . . a n such that {y?(x; ao), ■ ■ ■ <p(x; a n )} is n-consistent but {n + l)-inconsistent. 

Remark 2.8. The following are equivalent, for (T,<f) (P n ): 

(1) There is k < uj such that the sequence (P n ) has support k. 

(2) if does not have the finite cover property. 

In practice, when analyzing saturation of <y9-types the finite cover property can often, but 
not always, be avoided by a judicious choice of formula. For instance, if ip is unstable, 
some fixed finite conjunction 9 of instances of p has the finite cover property ([8] .11.4); if 
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we choose to present y?-types as ^-types the characteristic sequence would not have finite 
support. Nonetheless, it may happen even in unstable theories that there is a set £ C C of 
formulas without the fcp such that M \= T is A + -saturated iff M realizes all (/?o-types over 
sets of size A for all <p G E. This is true, for instance, of £ = {ip(x; y, z) := xRy A -^xRz} 
in the random graph, and of £ = {ip(x; y,z) := y < x < z} in (Q, <). 

3. Some examples 

This section works out several motivating examples. We refer informally to localization 
and persistence, which will be defined in Definitions mZl and [T5l below; the general definitions 
of (rj, i/)-arrays and trees will be given in Definition 14.21 

Example 3.1. (The random graph) 

T is the theory of the random graph, and R its binary edge relation. Let (p(x; y, z) = 
xRy A ~^xRz, with (T, (p) \— > (P n ). Then: 

• p i((y, z )) y^ z - 

• Pn((yi, zt),... (y n , z n )) ^> {y h ...y n }n {z 1 , . . . z n } = 0. 
Notice: 

(1) The sequence has support 2. 

(2) There is a uniform finite bound on the size of an empty graph C C Pi, C 2 n Pi = 0: 
an analysis of the theory shows that (p is not dividable, and inspection reveals this 
bound to be 3. 

(3) P n does not have the order property for any n and any partition of the y±, . . . y n into 
object and parameter variables. (Proof: The order property in P n implies dividability 
of (f2n by Observation 15.91 But none of the ipi are dividable, as inconsistency only 
comes from equality.) 

(4) Of course, the formula (p has the independence property in T. We can indeed find a 
configuration in P 2 which witnesses this: any C which models the T -configuration 
having Vx = oj and {i, j} ^ Ex 3n(i = 2n A j = 2n + 1). Note that ip will 
have the independence property on any infinite P2-complete subgraph of the so-called 
(u, 2)-array C (see Observation 16. 1 II below) . 

(5) As <p is unstable, y?-types are not necessarily definable in the sense of stability theory. 
However, we can obtain a kind of definability "modulo" the independence property, 
or more precisely, definability over the name for a maximal consistent subset of an 
(u, 2)-array as follows: 

Definable types modulo independence. Let p G S(M) be a consistent partial y?-type presented 
as a positive base set A C Pi. Let us suppose p h {xRc : c G C} U {^xRd : d G D} h p, so 
that A C M 2 is a collection of pairs of the form (c, d) which generate the type. 

There is no definable (in T with or without parameters, so in particular not from P 2 ) 
extension of the type A, so we cannot expect to find a localization of Pi around A which is 
a P2-complete graph. However: 

Claim 3.2. In the theory of the random graph, with (p(x; y, z) = xRy A ->xRz as above, for 
any positive base set A C Pi there exist a definable (u, 2) -array W C Pi, a solution SofW 
and an S-definable Poo-graph containing A. 
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Proof. Work in Pi. Fix any element (a, b) with a, b ^ C, P> and set Wq := {(y,^) G -Pi : 
-.P 2 ((j/,z),(o,6))}. Thus l^o = {(&,*) : 2 ^ 6} U {(y,o) : y ^ a}. So the only P 2 - 
inconsistency among elements of Wq comes from pairs of the form (b, c), (c, a); thus, writing 
Greek letters for the elements of Pi, 

(V77 g vy )(3z/ g W )(VC g W„) (-P 2 (r/,C) - C = 

In other words, := \ {(^; a )} is an (^,2)-array (Definition 14. 2p . Moreover: 

(1) (y, z), (w, v) EW and -iP 2 ((y, z), (it?, f )) implies y = v or z = w, and 

(2) for any c ^ a, 6, there are d,e E M such that (d, c), (c, e) G W 7 . Thus: 

(3) we may choose a maximal complete P 2 -subgraph C of W such that is a complete 
Poo-graph. For instance, let C be any maximal complete extension of {(&, d) : d G 
.0} U {(c, a) :cGC}. Call any such C a solution of the array W. 

Let 5 be a new predicate which names this solution C of W. Then {y e Pi : z G S 1 — > 
P2(y,z)} D A is a P2-complete graph, definable in C U {S}. Support 2 implies that it is a 
Poo-graph. Notice that by (2), we have in fact chosen a maximal consistent extension of A 
(i.e. a complete global type). □ 

Remark 3.3. The idiosyncracies of this proof, e.g. the choice of a definable (u,2)-array, 
reflect an interest in structure which will be preserved in ultrapowers. 

Example 3.4. (Coding complexity into the sequence) 

It is often possible to choose a formula (p so that some particular configuration appears 
in its characteristic sequence. For instance, by applying the template below when (p has 
the independence property, we may choose a simple unstable 9 whose P 2 is universal for 
finite bipartite graphs (X, Y), provided we do not specify whether or not edges hold between 
x,x' G X or between y, y' G Y. Nonetheless, Conclusion 15.101 below will show this is 
"inessential" structure in the case of simple theories: whatever complexity was added through 
coding can be removed through localization. 

The construction. Fix a formula <p of T. Let 9(x; y,z,w) := (z = w A x = y) V (z ^ 
w A (p(x; y)). Write (y, *) for (y, z, w) when z — w, and (y, — ) for (y, z, w) when z ^ w. Let 
(P n ) be the characteristic sequence of 9, (P%) be the characteristic sequence of ip, and (P^) 
be the characteristic sequence of x = y. Then P n can be described as follows: 

• P n ((yi, -),... (y n , -)) <-> PJ(y u ...y n ). 

• Pn((Vl, *),••• (Vn, *)) Pn (Vl> ■ ■ -Vn)- 

• Otherwise, the n-tuple y := ((yi, Zi), . . . (y n , z n )) can contain (up to repetition) at 
most one *-pair, so Zi = Zj = * —>■ yi = yj. In this case the unique y* in the *-pair is 
the realization of some y-type in the original model M of T, and 

Pn+i((y*, *), (yi, -),... (yn, -)) holds iff M |= Aj<„ <p(y*; 

Remark 3.5. This highlights an important distinction: the fact that a characteristic sequence 
may contain a bipartite graph is not anywhere near as powerful as the fact of containing a 
random graph, see Example \3.S\ below. In the coding just given we could not choose how 
elements within each side of the graph interrelated. This is quite restrictive, and eludes our 
coding for deep reasons: for instance, applying a consistency result of Shelah on the Keisler 
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order one can show that the order property in the characteristic sequence cannot imply the 
compatible order property in the characteristic sequence, Definition ^ -S\ below [TJ. 

Example 3.6. (A theory with TP 2 ) 

TP 2 is Shelah's tree property of the second kind, to be defined and discussed in detail in 
Definition 14.11 Let T be the model completion of the following theory [12J. There are two 
infinite sorts X,Y and a single parametrized equivalence relation E x (y,z), where x G X, 
and y, z 6 Y. Let ip eq := (p(y; xzw) = E x (y, z) A ->E x {z, w). Then: 

• Pi((xzw)) -<==>- z 7^ w. 

• PziixiZiWi), (x 2 z 2 w 2 )) -<=>- each triple is in Pi and furthermore: 

(xt =x 2 )-+( E x (z u z 2 ) A f\ ~<E x (wi, Zj)) 

The sequence has support 2. There are many empty graphs; these persist under local- 
ization (Theorem 16.24)) . One way to see the trace of TP2 is as follows. Fixing a, choose 
aj {% < uj) to be a set of representatives of equivalence classes in E a , and choose b such that 
-iE a (cii,b) (i < uj). Then {(a,ai,b) : i < uj} C Pi is a P2-empty graph. We in fact have 
arrays {(a*, a\, 6') : i < u,t < 00} whose "columns" (fixing t) are P2-empty graphs and where 
every path which chooses exactly one element from each column is a P2-complete graph, 
thus a Poo-complete graph. The parameters in this so-called (u, cu)-array describe TP 2 for 
ip eq (Claim H2]) . 

Note that a gap has appeared between the classification-theoretic complexity of </?, which 
is not simple, and that of the formula P2: 

Claim 3.7. P2 does not have the order property. 

Proof. This is essentially because inconsistency requires the parameters x to coincide. Sup- 
pose that (ai, bi : i < u) were a witness to the order property for P 2 . Fix any a« = (a s , a s , d s ). 
Now -iP 2 (bj, di) for j < i, where bj = (ft t , h, c t ). P 2 -inconsistency requires a s = (3 t . As this is 
uniformly true, a s = a t = /3 S = (3 t for all s, t < uj in the sequence. But now that we are in a 
single equivalence relation E a , transitivity effectively blocks order: -*P 2 (bj, aj) <-> -^E a (a s , b t ). 
Depending on whether at least one of the a- or 6-sequences is an empty graph, we can find 
a contradiction to the order property with either three or four elements. □ 

Example 3.8. (A maximally complicated theory) 

In this example the sequence is universal for finite T -configurations (Convention 12. 6p . a 
natural sufficient condition for "maximal complexity." 

Let the elements of M be all finite subsets of uj; the language has two binary relations, C 
and =, with the natural interpretation. Set T = Th(M). 

Choose ipc '■= <f{x; y, z) = x C y A x % z. Then: 

• Pi((y,z)) ^ ®c y % z . 

• P„((2/l,2l),...(j/n,^)) ^ £ (\< n yi £ Ui<n Z i- 

The sequence does not have finite support. Moreover: 

Claim 3.9. Let (P n ) be the characteristic sequence of <£>c, k < uj, and let X be a finite 
Tq- configuration. Then there exists a finite A C P x witnessing X . 
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Proof. Write the elements of Pi as Wi = (yi,Zi); it suffices to choose the positive pieces \ji 
first, and afterwards take the Z\ to be completely disjoint. More precisely, suppose X is given 
by Vx = m and Ex C V(m). We need simply to choose yi, . . . y m such that for all a Cm, 




which again, is possible by the downward closure of Ex- D 

Corollary 3.10. This characteristic sequence is universal for finite Tq- configurations. 

Remark 3.11. That the sequence is universal for finite T - configurations is sufficient, though 
not necessary, for maximal complexity in the Keisler order. By ^\.VI.3, <f(x;y,z) = y < 
x < z in Th(Q, <) is maximal. Its characteristic sequence has support 2, but its P2 is clearly 
not universal. 

4. Static configurations 

This section establishes a series of correspondences between Ti-configurations found in 
the characteristic sequence of f and the classification-theoretic complexity of if itself. It 
lays the groundwork for the next section, which will build "dynamic" arguments out of 
these "static" ones by asking what happens when certain configurations persist under all 
reasonable restrictions of the set P\. 

Here we describe configurations which signal that if has the order property, the indepen- 
dence property, the tree property and SOP 2 . Recall that: 

Definition 4.1. (Tree properties) Let C indicate initial segment. To simplify notation, say 
that the nodes pi, P2 G oj <uj are "Incomparable if 

->(pi Q P2) A ^(p 2 C p x ) A ->(3u G uj <UJ ,i,j G io)(pi = v~i,p 2 = u~j) 

i.e., if they do not lie along the same branch and are not immediate successors of the same 
node. 

Then the formula if has: 

• the fc-tree property, where k < uj, if there is an uj <w -tree of instances of if where paths 
are consistent and the immediate successors of any given node are k-inconsistent, i.e. 
X = {(p(x; a v ) : r\ G oj <U) }, and: 

(1) for all v G uj u , {<f{x; a v ) : 77 C u} is a consistent partial type; 

(2) for all p G u <UJ , {f(x; a p ~i) : i < uj} is k-inconsistent. 
Call any such X a ip-tree, or if necessary a ip-k-tree. 

• the tree property if it has the k-tree property for some 2 < k < uj. 

• the non-strict tree property TP2 if there exists a if -tree with k = 2 and for which, 
moreover: 

(3)2 for any two * incomparable p\,p 2 G u <UJ , 3x(ip(x; a pi ) A <f(x; a P2 )). 

• the strict tree property, also known as TP\ or SOP2, if there exists a ip-tree with 
k = 2 and for which, moreover: 

(3)i for any two * incomparable pi,p2 G u <UJ , -i3x((p(x; a Pl ) A <f(x; a P2 j). 

Theorem A. (Shelah; see gJ.III.7j 

• T is simple iff no formula if ofT has the tree property, iff no if has the 2-tree property. 
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• If cp has the 2-tree property then either tp has TP\ or (p has TP 2 . 

We fix a monster model M from which the parameters are drawn; see Convention 12.31 

Definition 4.2. (Diagrams, arrays, trees) Let A > \i be finite cardinals or to. Write C to 
indicate initial segment. The sequence (P n ) has: 

(1) an (u, 2)-diagram if there exist elements {a v : 77 G 2 <UJ } C Pi such that 

• for all 1] G 2 <UJ , -iP 2 (ar ) -0; °"q~l)} an d 

• for all n < u and rji, ... r\ n G 2 <UJ , we have that 771 C ••• C r\ n P n (a m , . . . a Vn ) 
That is, sets of pairwise comparable elements are Poo-consistent, while immediate 
successors of the same node are P2-inconsistent. 

(2) a (A, /1, l)-array if there exists X = {a™ : / < A, m < ji} C P\ such that: 

• P 2 (a™\a™*) ^ (h = l 2 ^ mi = m 2 ) 

• For all i < ijj, 

P n {al\...a- : ) <=► j\ P 2 (a™%a™<) 

l<i,j<n 

That is, any C C X , possibly infinite, is a P^-graph iff it contains no more than 
one element from each column. (We will relax this last condition in the more general 
Definition \6.9\ below.) 

(3) a (A,yu)-tree if there exist elements {a v : 77 G /i <A } C P± such that 

• for all 772, 772 £ H <X , 

P 2 (a v , a v ) (771 C 772 V ?7 2 C rix) 

i.e. only if the nodes are comparable; and 

• for all n < u, r]i, . . . r\ n G /i <A , 

Remark 4.3. Diagrams are prototypes which can give rise to either arrays or trees, in the 
case where the unstable formula (p has the independence property or SOP 2 , respectively. 
The arrays will be revisited in Definitions \KT\ and \6'.y\ . 

Claim 4.4. Let (p be a formula of T and set 9(x; y, z) = <p(x; y) A -«p(x; z) . Let (P n ) be the 
characteristic sequence of (T, 6). The following are equivalent: 

(1) (P n ) has an (u, 2) -diagram. 

(2) R(x = x, <p(x; y), 2) > u, i.e. ip is unstable. 

(3) R(x = x, 6(x; yz), 2) > uo, i.e. 9 is unstable. 

Proof. (2) — > (1): We have in hand a tree of partial y?-types 71 = {p u : v G 2 W }, partially 
ordered by inclusion, witnessing that R{x = x, ip, 2) > uj. Let us show that we can build an 
(u, 2)-diagram. That is, we shall choose parameters {a v : 77 G 2 <UJ } C Pi satisfying Definition 

Mi). 

First, by the definition of the rank R, which requires the partial types to be explicitly 
contradictory, we can associate to each v an element c v G M, £(c u ) = £(y) such that: 

• <p{x;c v ) G p v ~i \p v , and 

• ^(p(x;c u ) G p v ~ \p v - 
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i.e., the split after index v is explained by <p(x; c u ). 
Second, choose a set of indices S C 2 <w such that: 

• (yr] G 2 <UJ ) (3s ES)(r]C s) 

• (Vsi C s 2 G S) (377 i S) (si C 77 C Sa ) 

It will suffice to define a s ~j for s & S, i & {0, 1}. (The sparseness of S ensures the chosen 
parameters for <p won't overlap, which will make renumbering straightforward.) Recall that 
the a v will be parameters for 6(x\ y, z) = <p(x; y) A ->{x\ z). So we define: 

• a s~l = ( c si <V>i). 

The consistency of the paths through our (ou, 2)-diagram is inherited from the tree 1Z of 
consistent partial types. However, -nP 2 (a s ~ , a s~i) because these contain an explicit contra- 
diction: 

~^3x ((<p(x; c s ~ ) A -Kp(x; c s )) A ((p(x\ c s ) A ->((p(x; c^U 

(1) — > (3): Reading off the parameters from the diagram we obtain a tree of consis- 
tent partial 6*-types {p v : r\ G 2 <UJ }, partially ordered by inclusion. For any r\ G 2 <UJ , 
->P 2 ( a rrCb a r)-i)i i- e - _| 3x(6'(x; a rr ) A 9(x; a^-i)). Furthermore, 9(x; a^o) G p^- \ P??; while 
9(x; a^i) G p^-x \ p v . So there is no harm in making the types explicitly inconsistent, as the 
rank R requires, by adding -i 9(x; a„-J to p v ~j for i ^ j < 2. 

(2) ^ (3): for all A, \A\ > 2, \S V (A)\ = \S e (A)\. □ 

Claim 4.5. Let (p be a formula ofT and set 9(x; y, z) = (p(x\ y) A ~i(p(x\ z). Let (P n ) be the 
characteristic sequence of(T,9). The following are equivalent: 

(1) (P n ) has an (uj, 2,1) -array. 

(2) ip has the independence property. 

(3) 9 has the independence property. 

Proof. (1) — > (3): This is Observation 16 . 1 ll (Essentially, let Aq be the top row of the array 
A, and a, t C A finite disjoint; let B C A be a maximal positive base set, i.e. a maximal 
Poo-complete graph, in A containing a and avoiding r. Then any realization of the type 
corresponding to B is a witness to this instance of independence.) 

(2) — > (1): Let (it : £ < uj) be a sequence over which ip has the independence property. For 
t < 2, j < UJ set a Q - = (i£,ii + i), a 1 - = (z^ +1 ,^). Then {a* : t < 2,j < uj} is an (u,2, l)-array 
for Poo. 

(3) — > (2): For any infinite A, \Stp(A)\ = \Sg(A)\, as any type on one side can be presented 
as a type on the other. The independence property can be characterized in terms of the 
cardinality of the space of types over finite sets ([8] Theorem II. 4. 11). □ 

Claim 4.6. Let ip be a formula ofT and set 9(x; y, z) = ip(x; y) A -^<p(x; z). Let (P n ) be the 
characteristic sequence of (T,9). Suppose that T does not have SOP2- Then the following 
are equivalent: 

(1) (P n ) has an (u,u),l)- array. 

(2) (p has the 2-tree property. 

Proof. (1) — > (2) Each column (=empty graph) of the array witnesses that ip is 2-dividable, 
and the condition that any subset of the array containing no more than one element from 
each column is a Poo-complete graph ensures that the dividing can happen sequentially. 
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(2) — > (1) By Theorem Kl above. NSOP2 implies tp has TP 2 . That is, there is a tree of 
instances {(p(x; a v ) : 77 G u; <w } such that first, for any finite n, r\\ C • • • C r]„ implies that 
the partial type {^(x; a m ), . . . ip(x] a Vn )} is consistent; and second, 

-3x yip(x] a v ) A (p(x; a u )^j (3p G c<j <a; )(3i 7^ j G uS) (r] = p^i A v = p^j^j 

Thus the parameters {a v : 77 G w <u } C Pi form an (uj, uj, l)-array for P^. □ 

It is straightforward to characterize the analogous k-tree properties in terms of arrays 
whose columns are ^-consistent but (k + l)-inconsistent. 

Claim 4.7. The following are equivalent: 

(1) (P n ) has an (uj,2)-tree. 

(2) y> has SOP 2 . 

Proof. (2) — > (1) This is a direct translation of Definition 14. 1[ 

(1) — > (2) It suffices to show that (P n ) has an (u, u;)-tree, which is true by compactness, 
using the strictness of the tree. □ 

In the next definition, the power of the classification-theoretic order property on the P n 
is magnified when it can be taken to describe the interaction between complete graphs, i.e. 
base sets for partial <^-types. Compare Remark 13.51 

Definition 4.8. P^ has the compatible order property if there exists a sequence C = (a*, hi : 
i < uj) C P\ such that for any n < uj and any a±, b\, . . . a n , b n C C , 

Pniiauh),. . .(a n ,b n )) (max{a 1; . . .a n } < min{6i, . . .&„}) 

Say that P m has the compatible order property to indicate that this holds for all P n , n < m. 

Observation 4.9. Suppose (T,ip) 1— > (P n ), and that (P n ) has the compatible order property. 
Then ifi2 has the tree property, and in particular, SOP2- 

Proof. Let us build an SOP2-tree {v?2(x; o^, b v ) : 77 G uj <uj } following Definition 14. II above by 
specifying the corresponding tree of parameters {c v : 77 G uj <uj } C Pi, where each c v is a pair 
(a v , b v ). Let S = (afti : i < Q) be an indiscernible sequence witnessing the compatible order 
property. We will use two facts in our construction: 

(1) Let (difij^ : £ < uj) be any subsequence of S such that t < k ==>- a it < bj e < ai k < 
bj k . Then {</) 2 (x; a i;) ., 6^.) : j < uj} 2-divides by Observation 15.71 

(2) Let a h ,b h ,. . . a in ,b jn G S. Then 

p n((a il ,b jl ),...(a in ,b jn )) max{ii, . . . i n } < min{ji, . . . j n } 

so in particular 

P2((ai 1 ,b jl ),(a i2 ,b j2 )) max{i 1 ,i 2 } < mw.{j 1} j 2 } 

Let 77 G uj <uj be given and suppose that either c v has been defined or 77 = 0. If c v has 
been defined, it will be (a i5 bj) for some i < j G Q. Let {kg : £ < u) be any cu-indexed subset 
of (i, j) n Q, or of Q if 77 = 0. Define = (a^, bk e+1 ). Now suppose we have defined the 
full tree of parameters c v in this way. By fact (1) we see that immediate successors of the 
same node are P 2 -inconsistent. By fact (2) n , paths are consistent, while by fact (2) 2 , any 
two ""incomparable (Definition 14.11) elements c u ,c v are P2-inconsistent. □ 
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Remark 4.10. The compatible order property in P^ is in fact enough to imply maximality 
in the Keisler order [7] . 

5. Localization and persistence 

The goal of these methods is to analyze <£>-types, and thus to concentrate on the combina- 
torial structure which is "close to" or "inseparable from" the complete graph A representing 
a consistent partial y?-type under analysis. Localization and persistence, defined in this 
section, are tools for honing in on this essential structure. 

P n asks about incidence relations on a set of parameters; it will be useful to definably 
restrict the witness and parameter sets. For instance: 

• we may ask that the witnesses lie inside certain instances of <p, e.g. by setting 
P{(y) = 3x(<p(x; y) A ip(x; a)), i.e. P[ = P 2 (y, a). 

• we may ask that the parameters be consistent 1-point extensions (in the sense of 
some P n ) of certain finite graphs C. For instance, we might define P"(y) = Pi{y) A 
P 2 (y,c 1 ) A P 3 (y,c 2 ,c 2 ). 

The next definition gives the general form. 

Definition 5.1. (Localization) Fix a characteristic sequence (T,(p) — ► (P n ), and choose 
B,AcM\=T with A a positive base set and A = possible. 

(1) (the localized predicate P[) A localization PJ[ of the predicate P n (yi, ■ ■ -y n ) around 
the positive base set A with parameters from B is given by a finite sequence of triples 
f : m — > to x Vtt (yi, . . . y n ) x V^ (B) where m < uj and: 

• writing f(i) = (r^, <jj, /%) and s for the elements of the set s, we have: 

P£( yi ,...y n ):= /\ P ri {diJi) 

• for each £ < uj, T\ implies that there exists a P^-complete graph Ci such that P£ 
holds on all n-tuples from CV If this last condition does not hold, P£ is a trivial 
localization. By localization we will always mean non-trivial localization. 

• In any model o/Ti containing A and B, P£ holds on all n-tuples from A. 

Write Locf (A) for the set of localizations of P n around A with parameters from B 
(i.e. nontrivial localizations, even when A — 0). 

(2) (the localized formula ipf) For each localization P? of some predicate P n in the char- 
acteristic sequence of (p, define the corresponding formula 

y f n(x; •• • Vn) ■= Vni.x; y u ... y n ) A Pj[(yi, ■■■y n ) 

When n = 1, write ^ = (f{. Let S^(N) denote the set of types p G S lp (N) such 
that for all {f^^x; qj, . . . ip% n {x\ c in )} C p, P/(cj i; . . . c in ). Then there is a natural 
correspondence between the sets of types 

SfrN) «- S vf (N) 
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(3) (the *localized formula ipf +a ) We have thus far described localizations of the param- 
eters of ip. We will also want to consider restrictions of the possible witnesses to ip 
by adjoining instances ofip k - That is, set 

pf+«(x; y) = v f+ a ^ (x; y) := <p(x; y) A P[{y) A <p k (x; a u ...a k ) 

where, as indicated, k = £(a). The * is to emphasize that this is really the construction 
from ip of a new, though related, formula, which will have its own characteristic 
sequence, given by: 

(4) (the *localized characteristic sequence (Pl +a : n < to)) The sequence (Pl +a : n < uo) 
associated to the formula ipf +a is given by, for each n < uj, 

p i +a (yi, ■ ■ -Vn) = A p i(Vi) A p n+kiyu ...y n ,a,i,...a h ) 

i<n 

When fora are empty, we will omit them. 

Remark 5.2. Convention \2.3\ applies: that is, localization is not essentially dependent on 
the choice of model M. See Definition 5.1J\ (Persistence) and the observation following. 



As a first example of the utility of localization, notice that when <p is simple we can localize 
to avoid infinite empty graphs. 

Observation 5.3. Fix a positive base set A for the formula ip, possibly empty. When ip does 
not have the tree property, then for each k < uj there is a finite set C over which ip is not 
k-dividable. As a consequence, if ip does not have the tree property, then for each predicate 
P n there is a localization around A on which there is a uniform finite bound on the size of 
a P n - empty graph. We can clearly also choose the localizations so that none of ipi , ■ ■ ■ i>i are 
k-dividable for any finite k, £ fixed in advance. 

Proof. This is the proof that D{x = x, ip, k) < u for any simple formula ip; see for instance 
|14j . (For a more direct argument, see the proof of Conclusion 15. 101 below.) □ 

The following important property of formulas was isolated by Buechler pp. 

Definition 5.4. The formula ip is low if there exists k < u such that for every instance 
ip(x;a) of ip, ip(x;a) divides iff it < k-divides. 

Observation 5.5. If (p does not have the independence property then ip is low. 

Proof. To show that any non-low formula ip has the independence property, it suffices to 
establish the consistency of the following schema. For k < u, ty k says that there there exist 
yii - ■ ■ V2k such that for every o C 2k, \o~\ = k, 



3x [ip(x; yi) <^> i G a 



But will be true on any subset of size 2k of an indiscernible sequence on which ip is 
/^-consistent but (k + l)-inconsistent, and such sequences exist for arbitrarily large k by 
hypothesis of non-lowness. □ 

Corollary 5.6. When the formula ip of Observation 15.31 is simple and low, we can find a 
localization in which ip is not k-dividable, for any k. 
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5.1. Stability in the parameter space. The classification-theoretic complexity of the 
formulas P n is often strictly less than that of the original theory T. Note that the results 
here refer to the formulas P n (yi, ■ ■ ■ Vn), riot necessarily to their full theory T\. 

Observation 5.7. Suppose (T,ip) i— > (P n ) ■ If Piix^y) has the order property then ip(x;y) A 
<p(x; z) is 2-dividable. 

Proof. Let (a*, b{ : % < uj) be a sequence witnessing the order property for P2, so P2(cn, bj) iff 
i < j. This means that 3x((p(x; aA A ip(x; bj)) iff i < j. So <£>(x; ai) A </?(x; are consistent 
for each i, but the set {<p(x; Oj) A &j+i) : i < uj} is 2-inconsistent. □ 



Remark 5.8. By compactness, without loss of generality the sequence of Observation 5.1 can 
be chosen to be (T— ) in discernible, and so actually witnesses the dividing of some instance 

of '-pi- 
Note that the converse of Observation 5.1 fails: for tp(x;y)A(p(x; z) to divide it is sufficient 
to have a disjoint sequence of "matchsticks" in P 2 (i.e. (aj,6j) : % < uj such that P2(ai,bj) iff 
i = j), without the additional consistency which the order property provides. 

Nonetheless, work relating the characteristic sequence to Szemeredi regularity illuminates 
the role of the order [7J. 

Observation 5.9. Suppose that (T,tp) 1— > (P n ), and for some n,k and some partition of 
yi, . . . y n into k object and (n-k) parameter variables, P n (yi, ■ ■ - yk] 2/fe+i) • • • Z/n) has the order 
property. Then (f n {x', yi, ■ ■ ■ y n ) is 2-dividable. 

Proof. The proof is analogous to that of Observation I5.7[ replacing the Oj by fc-tuples and 
the bj by (n-&)-tuples. □ 

Thus in cases where we can localize to avoid dividing of (p, we can assume any initial 
segment of the associated predicates P n are stable: 

Conclusion 5.10. For each formula if and for all m < uj, if </3 2n does not have the tree 
property, then for each positive base set A there are a finite B and P( £ Locf (A) over which 
P2, . ■ ■ P n do not have the order property. In particular, this holds ifT is simple. 

Proof. We proceed by asking: do there exist elements (y%Zi : % < uj) such that (1) each yiz-i 
is a 2-point extension of A and (2) {y%Zi : i < uj) witnesses the order property for P2I If not, 
localize using the finite set of conditions in (1) which prevent (2). Otherwise, let ax, 61 be 
the first pair in any such sequence, set A\ := A U {a 1; 6x} and repeat the argument using 
A\ in place of A. By simplicity, there is a uniform finite bound on the number of times 
ip n (see Observation 15. 9p can sequentially divide. Condition (1) ensures that the dividing 
is sequential, corresponding to choosing progressive forking extensions of the partial type 
corresponding to A. At some finite stage t this will stop, meaning that (1) and (2) fail with A t 
in place of A; the finite fragment of (1) which prevents (2) gives the desired localization. □ 

By way of motivating the next subsection, let us prove the contrapositive: If the order 
property in P2 persists under repeated localization, then ip has the tree property. Compare 
the proof of Observation 14.91 above. Without the compatible order property, we cannot 
ensure the tree is strict. While that argument built a tree out of a set of parameters which 
were given all at once (a so-called "static" argument), the following "dynamic" argument 
must constantly localize to find subsequent parameters, so cannot ensure that elements in 
different localizations are inconsistent. 
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Lemma 5.11. Suppose that in every localization of P\ (around A = fy), P 2 has the order 
property. Then ip 2 has the tree property. 

Proof. Let us describe a tree with nodes (c v ,d v ), (77 G uj <uj ), such that: 

(1) for each p G uj u , {c v , d v : r\ C p} is a complete P^-graph, where C means initial 
segment. 

(2) for any v G lj <u1 , P 2 {c^ il d^ j ) <==^ i < j. 

For the base case (77 G u 1 ), let (cj,<ij : i G oj) be an indiscernible sequence witnessing the 
order property (so P 2 (ci, df) -<=/- i < j) and assign the pair (q, <ij) to node i. 

For the inductive step, suppose we have defined (c v , d v ) for r\ G u n . Write E v = {(c u , d v ) : 
v < 7]} for the parameters used along the branch to (c v , d v ). Using x to mean the elements of 

f 

the set x, let P( v be given by P n+ i((y, z), E v ). Let (aj, bj : j G uo) be an indiscernible sequence 
witnessing the order property inside this localization, and define (c^, d v ~i) := (aj,bj). 

Finally, let us check that this tree of parameters witnesses the tree property for (p 2 - On 
one hand, the order property in P 2 ensures that for each n G uj <uj , the set 

{<f2(x; c ri , d^i) :ieu} 

is 1-consistent but 2- inconsistent. On the other hand, the way we constructed each local- 
ization P( v ensured that each path was a complete P^-graph, thus naturally a complete 
P^-graph, where (P^) is the characteristic sequence of the conjunction ip 2 . □ 

Remark 5.12. Example \3. 6i shows that the condition that (p has the tree property is neces- 
sary, but not sufficient, for the order property in P 2 to be persistent, Definition \5.1J\ below. 

Question 5.13. Is SOP 2 sufficient? 

Compare the issue of whether SOP 2 SOP 3 : see [2], [T2"] . 

5.2. Persistence. Localization, Definition 15.11 above . gives rise to a natural limit question: 
what happens when certain To-configurations persist under all finite localizations? 

Definition 5.14. (Persistence) Fix (T,tp) 1— > (P n ) , M \= T sufficiently saturated, and a 
positive base set A, possibly 0. Let X be a Tq- configuration, possibly infinite. Then X is 
persistent around the positive base set A if for all finite B C M and for all P( G Locf(A), 
Pf contains witnesses for X . 

We will write X is A-persistent to indicate that X is persistent around A. 

Note 5.15. Persistence asks whether all finite localizations around A contain witnesses for 
some T - configuration X. The predicates P n mentioned in X are, however, not the localized 
versions. We have simply restricted the set from which witnesses can be drawn. This is an 
obvious but important point: for instance, in the proof of Lemma [6.61 below it is important 
that the sequence of P2-inconsistent pairs found inside of successive localizations P/ n are 
Princonsistent in the sense of T\. 

Observation 5.16. (Persistence is a property of the theory T) The following are equiv- 
alent, fixing T,if,(P n ), A C Ai a small positive base set in the monster model, and a 
Tq- configuration X . Write P[{M) for the set which P( defines in the model M . 

(1) In some sufficiently saturated model M |= T\ which contains A, X is persistent around 
A in M . That is, for every finite B C M and every localization P( G Locf (A), there 
exist witnesses to X in P/(M). 
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(2) In every model N \= T\, N D A, for every finite B C N , every localization P( G 
Locf(A), and every finite fragment X of X, P((N) contains witnesses for X . 

Proof. (2) (1) Compactness. 

(1) =>- (2) Suppose not, letting P[ G Locf(A) and X witness this. To this P[ we can 
associate a Ti-type p(y%, ■ ■ -y\B\) G S(A) which says that the localization given by / with 
parameters yi, . . -y\B\ contains A but implies that X is inconsistent. But any sufficiently 
saturated model containing A will realize this type, and thus contain such a localization. □ 

To reiterate Convention 12.31 then, we may, as a way of speaking, call a configuration 
"persistent" while working in some fixed sufficiently saturated model, but we always refer to 
the corresponding property of T. 

Corollary 5.17. Persistence around the positive base set A remains a property ofT in the 
language with constants for A. 

Finally, let us check the (easy) fact that persistence of some T -configuration around 
in some given sequence (P n ) implies its persistence around any positive base set A for that 
sequence. Recall that all localizations are, by definition, non-trivial. 

Fact 5.18. Suppose that X is an -persistent T - configuration in the characteristic sequence 
(P n ) and A is a positive base set for (P n ) . Then X remains persistent around A. 

Proof. Let p(xo, . . . ) in the language C(=, Pi, P2, • • • ) describe the type, in Vx-many vari- 
ables, of the configuration X = (Vx,Ex)- Let q(y) G S(A) be the type of a 1-point Poo- 
extension of A in the language £0 = {Pn '■ n < uj} U {=}. We would like to know that 
q(xo), q(xi), . . . ,p(x , . . . ) is consistent, i.e., that we can find, in some given localization, 
witnesses for X from among the elements which consistently extend A. If not, for some 
finite subset A' C A, some n < uj, and some finite fragments q' of q\A> and p' of p, 

q'(x ) U ■ ■ ■ U q'(x n ) h ->p'(x , ...x n ) 

But now localizing Pi according to the conditions on the lefthand side (which are all positive 
conditions involving the P n and finitely many parameters A) shows that X is not persistent, 
contradiction. □ 

6. Dividing lines: Stability, Simplicity, NIP 

The first natural question for persistence is: given n, when isn't it possible to localize 
Pi so that P n is a complete graph? The answer, surprisingly, is: in the presence of the 
independence property. This section gives the argument, using the language of persistence 
to give a new description of NIP and of simplicity, Theorem 16.191 and Theorem 16.241 below. 
Recall that a theory T is NIP [10] if no formula of T has the independence property; for 
more on this hypothesis, see |13j . 

6.1. NIP: the case of P2. We will see that if if is NIP then we can localize around any 
fixed positive base set so that P2 is a complete graph. 

The argument in this technically simpler case will generalize without too much difficulty. 
We first revisit an avatar of the independence property. 

Definition 6.1. ((uj, 2)-arrays revisited) 
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(1) The predicate P n is (uj, 2) if there is C := {a\ : t < 2, i < uj} such that for all £ < n, 
any i-element subset Co, 



(2) If for all n < uj, P n is (uj, 2), we say that P^ is (uj, 2). 

(3) A path through the (uj,2)-array A is a set X C A which contains no more than one 
element from each column. So paths are positive base sets. 

Remark 6.2. If Pod is (uj,2), then (p has the independence property. 

Proof. Let X be a maximal path through the (uj, 2)-array A. Choose any o, r C X finite and 
disjoint. Let Y a%T C A be a maximal path such that o C Y a>T and Y ajT C\T = 0. Y a>T is a positive 
base set, so any element c realizing the corresponding <p-type will satisfy a G a — > <p(c; a) 
and 66r-> ^(c; fy- Thus <p has the independence property on X. □ 

Lemma 6.3. (Springboard lemma for 2) If ip is stable then there is a finite localization P( 
for which TFAE: 

(1) There exists X C P(, X an (u, 2) -array wrt P2 

(2) There exists Y C P( , Y an (uj, 2) -array wrt P^ 

Proof. Choose the localization P[ according to Observation 15.31 so that neither (p nor ^2 are 
dividable using parameters from P[. This is possible because stable formulas are simple and 
low, and (p stable implies y?2 stable. Let Z — (c\ : t < 2,i < uj) C P( be an indiscernible 
sequence of pairs which is an (tu, 2)-array for P 2 . Each of the sub-sequences (c° : i < uj), 
(c\ : % < uj) is indiscernible, so will be either P 2 -complete or P 2 -empty; by choice of P( , they 
cannot be empty. 

It remains to show that any path X C Z is a Poo-complete graph. Suppose not, and 
let n be minimal so that the n-type of some increasing sequence of elements z* 1 , . . . z* n 
implies -^3x(f\ i<n ip(x; Choose an infinite indiscernible subsequence of pairs Z' C Z 2 

of the form (c®,c} +1 : i G W C uj). Then the set {<p(x;c®) A v?(a;;c J 1 +1 ) : z G W 7 } will be 
1-consistent by definition but n-inconsistent by assumption (though not necessarily sharply 
n- inconsistent). This contradicts the assumption that (p 2 is not dividable in P(. □ 

When the formula is low but not necessarily simple, bootstrapping up to P^ is still possible 
but requires a stronger initial assumption on the array. 

Corollary 6.4. Suppose the formulas (p and (p 2 are low. Then there exists k < uj such that, 
in any localization P( , TFAE: 

(1) There exists X C P( , X an (uj, 2)-array wrt P k 

(2) There exists Y C Pj , Y an (ui,2)-array wrt P^ 

Proof. Let fco be a uniform finite bound on the arity of dividing of instances of <p and ip 2 , 
using lowness; by the proof of the previous Lemma, any k > 2k will do. □ 

Recall from Definition 12.11 that an "empty pair" is the To-configuration given by V x = 
2, E x = {{1}, {2}}, i.e., a pair y, z such that P\(y), Pi(z) but ~^P 2 (y, z). 

Lemma 6.5. Suppose <p is stable, and that every localization P( around some fixed positive 
base set A contains an empty pair. Then P^ is (uj,2). 
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Proof. Choose P/° to be a localization given by Lemma 16731 We construct an (u, 2)-array as 
follows. 

At stage 0, let c{], Cq be any pair of P 2 -incompatible elements each of which is a consistent 
1-point extension of A in P/°. At stage n + 1, write C n for {c\ : t < 2, i < n} and suppose 
we have defined P/ n G Locf"(A). By hypothesis, there are c° +1 ,c* +1 G P( n such that 
— 1P2 , c^ +1 ) and such that each c^ +1 is a consistent 1-point extension of A (Fact 15. 18|) . 
Let C n+1 = C n U {c° +1 ,4 +1 } and define P/" +1 G Locf' l+1 (A) by 

Pt+\y) = P({y) A P 2 (y; c° +1 ) A P 2 (y; c* +1 ) 

Thus we construct an (cu, 2)-array for P 2 , as desired. Applying Lemma 16.31 we obtain an 
(u>, 2)-array for P^. □ 

Conclusion 6.6. Suppose that tp is stable, (T, tp) \— > (P n ) and A is a positive base set. Then 
empty pairs are not persistent around A. 

Proof. By stability, we may work inside the localization given by Lemma [6731 Suppose empty 
pairs were persistent around A. By Lemma [6.51 P^ is (u;, 2), which by Remark 16.21 implies 
that tp has the independence property: contradiction. □ 

In order to replace the hypothesis of stable with low, we will need to replace P 2 -consistency 
in the proof of Lemma 16.51 with P^-consistency. This argument is given in full generality in 
Lemma 16. 161 but here we state the result: 



Corollary 6.7. (to Corollary 6.4\) Suppose p and tp 2 are both low. Suppose every localization 
P( around some fixed positive base set A contains an empty pair. Then P^ is (u;, 2). 

In fact, modulo the proof of Lemma [6.161 we have shown: 

Conclusion 6.8. Suppose that tp is NIP, (T, tp) i— ► (P„) and A is a positive base set. Then 
empty pairs are not persistent around A. 

Proof. By Observation 15.51 all NIP formulas are low. By Fact I6.15[ p NIP implies p2 is NIP 
and therefore low. By Corollary 16.71 and Remark 16.21 the persistence of empty pairs would 
imply p has the independence property, contradiction. □ 

6.2. NIP: the case of n. We now build a more general framework, working towards The- 
orem EIZl which generalizes Conclusions I6.61[6~81 to the case of arbitrary n < to: if T is NIP 
then no P ra -empty tuple can be persistent. The basic strategy is as follows. If a P„-empty 
tuple is persistent, Lemma 16.161 produces an (u, n)-array. In this higher- dimensional case, 
in order to extract the independence property from an (cu, n)-array via Observation 16 . 1 ll we 
need the array to have an additional property called sharpness. The "sharpness lemma," 
Lemma 16 . 14[ returns an array of the correct form at the cost of possibly adding finitely many 
parameters. Fact 16 .151 then pulls this down to the independence property for p. 

With some care, we are able to get quite strong control on the kind of localization used. 
When T is stable in addition to NIP, the argument can be done with a uniform finite bound 
(as a function of n) on the arity of the predicates P m used in localization. 

Definition 6.9. {{u, n)-arrays revisited) Assume n < r < u . Compare Definition ^ -ty here, 
the possible ambiguity of the amount of consistency will be important. 
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(1) The predicate P r is (uo, n) if there is C — {c\ : t < n,i < uj} C Pi such that, for all 

<$,...<£ ec, 

• r -tuples from r distinct columns are consistent, i.e. 

j,k<r 

• and no column is entirely consistent, i.e. for all a C r, \a\ = n, 

f\ i j= i k =► -.P r (<* . . . C%) 

j,kea 

Any such C is an (uo, n)-array. The precise arity of consistency is not specified, see 
condition (4). 

(2) If for all n <r < uo, P r . is (uo,n), say that Poo is (ui,n). 

(3) A path through the (uj,n) array C is a set X C C which contains no more than n-1 
elements from each column. 

(4) P r is sharply (cu,n) if it contains an {uj,n)-array C on which, moreover, for all 

{<$,...<£} cc 

p r {c%...c%) ^ /\ (/\i j = i k => y t j = t k ) 

aCr,\a\=n \j,k<=cr jj^kecr / 

i.e., if every path is a P r -complete graph. 

(5) Pqo is sharply (u,n) if P r is sharply (ou,n) for all n < r < uj. 

Remark 6.10. (1) Every (u,2)-array is automatically sharp. 

(2) Suppose Poo has an (u,n)- array; this does not necessarily imply that P m has an 
(u, to)- array for m < n, because m elements from a single column need not be incon- 
sistent, e.g. if the (u,n)-array is sharp. 

Observation 6.11. If Poo is sharply (cu,k) then (fk-i has the independence property. 

Proof. Let X = (a], . . . a\ : i < uj) be the array in question; then ifk-i has the independence 
property on any maximal path, e.g. B := (a}, . . . a^ -1 : i < uo). To see this, fix any er, r C uo 
finite disjoint; then by the sharpness hypothesis {a}, . . . a^ 1 : % E a} U {a|, . . . : j E t} 
is a Poo-complete graph and thus corresponds to a consistent partial y?-type q. But any 
realization a of q cannot satisfy <p(x; aj) for any j E r, because Pk does not hold on the 
columns. A fortiori -Kp k (a; a°, . . . Oj -1 )- D 

Let us write down some conventions for describing types in an array. 

Definition 6.12. Let x\,x S j be elements of some (uj, n)- array X . 

(1) Let [x\] = {x S j E X : j = i}, i.e. the elements in the same column as x\. 

(2) Let X = {x 1 ^, . . -x^} C X be a finite subset. The column count of . . .x 1 ^} is 
the unique tuple (toi, . . .me) E uo 1 such that: 

• TOj > TOj+i for each i < £ 

• Ej rrii — £ 
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if Y = {yi, . . . y r } is a maximal subset of X such that y, z £ Y ,y ^ z 
[z], then some permutation of 

Mnx [y r ]nx 

is equal to (mi, ■ ■ - mi). 
In other words, we count how many elements have been assigned to each column, and 
put these counts in descending order of size. Write col-ct(x) for this tuple. 
(3) Let < be the lexicographic order on column counts, i.e. (1, 1, . . . ) < (2, 1, . . . ). This is 
a discrete linear order, so we can define (mi, . . . m^) + to be the immediate successor 
of (mi, . . . mi) in this order. Define gap((mi, . . . mi)) = rrii where ((ni, . . . ni) + = 
(mi, ■ ■ ■ me) and Vj ^ i mj = rij, i.e. the value which has just incremented. 

By analogy to Lemma 16.31 and its corollary, 

Lemma 6.13. (Springboard lemma) Fix 2 < n < ui, and let (P n ) be the characteristic 
sequence of (T,ip). Suppose that the formulas if, if2, ■ ■ ■ f2n-2 are low. Then there exist 
1 < ko < u and a localization P( of Pi in which the following are equivalent: 

(1) P( contains a sharp (oj,n)- array for P^, where ji = (2n — 2)/c - 

(2) P[ contains a sharp (uj, n)- array for P m . 

Proof. Assume (1), so let C = {c* : t < n, i < uj} C P( be sharply (uj, n) for P M , chosen with- 
out loss of generality to be an indiscernible sequence of n-tuples. Fix a path Y — yi, . . . y m 
of minimal size m > n such that ^P m (yi, ■ ■ ■ y m )- Let S := {c°, . . . cf~ , c\ +l , . . . c™ +1 : i < 
uj} C C 2n ~ 2 be a sequence of pairs of offset (n — l)-tuples. 
Note that S is 1-consistent as we assumed (1). 

On the other hand, C is indiscernible, so any increasing sequence of m elements from S 
will cover all the possible m-types from C. Since Y is inconsistent, this implies that S is 
m-inconsistent. These m elements will be distributed over at least instances of y?2n-2; 
by inductive hypothesis, one fewer element, thus one fewer instance, would be consistent. 
Thus v ? 2n-2 is sharply m'-dividable for some m' > k^. 

The appropriate ko is thus a strict upper bound on the possible arity of dividing of each 
of the formulas {y?2£-2 : 1 < I < ko}, which exists by lowness. When T is low but possibly 
unstable, determining k is the important step; no localization is then necessary. When T is 
stable, however, w.l.o.g. k = In — 2 as by Corollary 15.61 we can simply choose a localization 
in which the 2n — 2 formulas are not fc-dividable for any k. □ 

We next give a lemma which will extract a sharp array from an array. Recall that P^ is 
the Tocalized sequence from Definition 15.11 i.e. the characteristic sequence of the formula 
<P(z;y) A /\ ae ^(x;a). 

Lemma 6.14. (Sharpness lemma) Let a c Pi be finite, n < uj. Suppose that P^ contains an 
(uj,n)- array. Then there exist a', i with a C a' c Pi and 2 < £ < n such that P^ contains 
a sharp (u,£) -array. 

Proof. Let us show that, given an (cu, n)-array for P^, either 

• there is a sharp (ui,n) array for P^, or else 

• by adding no more than finitely many parameters we can construct an (a;, £)-array 
for P^ and some £ < n. 
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Note that the second is nontrivial by Remark 16.101 As an (a;, 2)-array is automatically 
sharp, we can then iterate the argument to obtain the lemma. 

We have, then, some (u, n)-array C in hand. Without loss of generality C is an indis- 
cernible sequence of n-tuples. If every path through C is a P^-complete graph then C is 
a sharp (a>, n)-array and we are done. Otherwise, choose some finite Z C C whose column 
count is as small as possible subject to the conditions: 

(1) Z is a path 

(2) There exists some Y G C such that 

(i) y n \z\ = 

(ii) y h y 2 eY [yt] n [y 2 ] = 

but Z U Y is not a P^-complete graph. 

In other words, Z is a possible new parameter set which is just slightly too large: the subset 
of C which is consistent with Z fails to be an (u;,n)-array because some set of elements 
from distinct columns is not consistent relative to Z. Set X := Z U Y, where Y is the finite 
sequence from (2). 

The assumption that C is not sharp gives an unspecified finite bound on \Z\; in fact the 
springboard lemma gives a more informative bound k > 2n — 2. On the other hand, by 
definition of (u, n)-array, any such Z must contain at least two elements from the same 
column, so \Z\ > 1 and we can find our witness working upwards on column count. Because 
C is an indiscernible sequence of n-tuples, we may assume that the elements of Z are in 
columns which are infinitely far apart. Finally, if Z C Z, then col-ct (Z ) < col-ct (Z). So 
for any W C C satisfying conditions (2).(i)-(ii) just given, Z U W is a P^-complete graph. 

In particular, we can choose a partition X = Xq U X\ where 

(I) x n x 1 = 0, c x 1 c x 

(II) x, x' e x x => \x\ = [x'\ 

(III) n>£:=\X 1 \= gap(col-ct(Z)) > 1 

(IV) For any W C C satisfying conditions (2).(i)-(ii), X U W is a P^-complete graph. 

To finish, let a' = a U Xo and let C C C be an infinite sequence of ^-tuples which realize 
the same type as X\ over a U Xq. (For instance, restrict C to the rows containing elements 
of X\ and to infinitely many columns which do not contain elements of Xq.) Since Z was 
chosen to be a path, I < n (condition (III)) and \a'\ < \X\ < u. By condition (2), -P/ (c) 
for any column c of C. On the other hand, by condition (IV) any subset of C containing no 
more than one element from each column is a P^-complete graph. Thus C is an (u, £)-array 
for P^, as desired. If it is not sharp, repeat the argument. □ 

Fact 6.15. The following are equivalent for a formula <p(x; y). 

(1) if has the independence property. 

(2) For some n < u, tp n has the independence property. 

(3) For every n < uj, (p n has the independence property. 

(4) Some * localization (p a has the independence property. 

Proof. (1) — > (3) — ► (2) — > (1) — > (4) are straightforward: use the facts that the formulas 
<Pj generate the same space of types, and that the independence property can be characterized 
in terms of counting types over finite sets ( [8] :II.4) . Finally, (4) — ► (2) as we have simply 
specified some of the parameters. □ 
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Lemma 6.16. Suppose that for some n < u, every localization of Pi around some fixed 
positive base set A contains an n-tuple on which P n does not hold. Then P^ is (oo,n), 
though not necessarily sharply (u,n). 

Proof. Let us show that Pk is (u),n) for any k > n. This suffices as, by Convention 12. 3[ we 

may apply compactness. 

Fix k > n and let P[ be any localization, for instance that of Lemma 16. 131 

At stage 0, let c° , cj, . . . cj" 1 C P/° := P( be an n-tuple of elements on which P n does 

not hold, chosen by Fact 15.181 so that each d is a consistent 1-point extension [in the sense 

of P n ] of A. Let X = {{cq} : i < n} be the set of these singletons. Write x to denote the 

elements of x. Define 

P{ 1 (y) = Pl°(y)A f\ P 2 (y,x) 

x£X 

which includes A by construction. 

At stage m + 1, write C m for {c\ : t < n, i < m} and consider the localized set of elements 
P( m G Locf m (A). Let 

X m := {x C C m : \x\ — k — 1 and for all i < m, \x fl {C i+ i \ CA\ < 1} 

i.e. sets which choose no more than one element from each stage in the construction. 

By hypothesis, there are c^ +1 , . . . c™^ G P( m such that -P n (c^ +1 , . . . +\) and such that 

for all x G X m , each c l m+1 is a consistent 1-point extension of A U x, in the sense of P n . Let 

C m+ i = C m U {c^ +1 , . . . c™ +\}, and let X m+1 be the sets from C m+ i which choose no more 

f c 

than one element from each stage in the construction. We now define P( m+1 G LoCi m+1 (A) 
by 

P{ m+1 (y) = P{-(y)A /\ P k (y;x) 

xex m +i 

(If m < k, the parameters from x need not necessarily be distinct.) Again, this localization 
contains A by construction. Thus we construct an (u, n)-array for P^, as desired. As k was 
arbitrary, we finish. □ 

Recall that a P n -empty tuple is any To-configuration for which X = n, {1, . . . n} ^ E x , i.e. 
Z/i, ■ ■ ■ Vn ^ Pl suc h that -iP n (yi, . . . y„). We are now in a position to prove: 

Theorem 6.17. Suppose that (f is NIP, (T,ip) i— > (P n ) and A is a positive base set for (p. 
Then for each n < u, P n - empty tuples are not persistent around A. 

Proof. By lowness, we work inside the localization P[ D A given by the Springboard Lemma 
16.131 Suppose that for some n < u, P n -empty tuples are persistent. Apply Lemma 16.161 
to obtain an (u, n)-array for P^, which is not necessarily sharp. The Sharpness Lemma 
16.141 then gives a sharp (cu, £) -array for P^, where a C Pi is a finite set of parameters. 
The sequence (P£) is just the characteristic sequence of the "localized formula (p a , that is, 
(p(x; y)A(p m (x; a), where m — \a\. By Observation 16.1 ll this means <Pg_ 1 has the independence 
property. Now by (2) — ► (1) of Fact 16. 151 applied to (p a , ip a has the independence property. By 
(4) — > (1) of the same fact, ip must also have the independence property, contradiction. □ 

Corollary 6.18. Suppose that if is NIP, (T,tp) i— > (P n ) and A is a positive base set for (p. 
Then for each n < w, there is a localization P[ such that: 
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(1) A c P{ 

(2) { Vl , ...y n }c P( P n { yi , ...y n ) 

In other words, when ip is NIP, given a positive base set A there is, for each n, a definable 
restriction of Pi containing A on which P n is a complete graph. In the other direction, if (p 
has the independence property then P^ is (u>,2) by Claim H751 so in particular Pi is not a 
complete graph. In fact: 

Theorem 6.19. Let (p be a formula ofT and (P n : n < uj) its characteristic sequence. 
Then the following are equivalent for any positive base set A: 

(1) There exists a localization ip? of ip such that ^ is NIP and P( D A. 

(2) There exists a localization ip 9 of <p such that <p 9 is stable and Pf D A. 

(3) For every n < u, there exists a localization P( n D A which is a P n - complete graph. 

Proof. Note that ip* = ip and ip 9 = ip are possible. 

(2) ^(1) because stable implies NIP. 

(1)^(3) is just Theorem 16.171 no P n -empty tuple is persistent, so eventually one obtains 
a localization which is a complete graph. 

(3) — >(2) By Claim H3J if tp 9 has the order property its associated Pf contains a diagram 
in the sense of Definition 14.21 Thus it contains an empty pair, and so a fortiori a P„-empty 
tuple, for each n. □ 

Example 6.20. Consider (Q, <), let tp(x;y,z) = y > x > z and let the positive base 
set A be given by concentric intervals {(a^, foj) : i < k} C P\. Then there is indeed a 
Pi-empty pair (ci, C2), (c?i, d 2 ) which are each consistent 1 -point extensions of A - namely, 
any pair of disjoint intervals lying in the cut described by the type corresponding to A. 
Localizing to require consistency with any such pair amounts to giving a definable complete 
graph containing A, i.e. realizing the type. 

6.3. Simplicity. We have seen that the natural first question for persistence, whether there 
exist persistent empty tuples, characterizes stability: Theorem 16.191 Here we will show that 
a natural next question, whether there exist persistent infinite empty graphs, characterizes 
simplicity. Recall that a formula is simple if it does not have the tree property; see [5], [14J. 

Notice that we have an immediate proof of this fact by Observation 15.3} which appealed 
to finite D(<p, A;)-rank for simple formulas to conclude that infinite empty graphs are not 
persistent. Let us sketch the framework for a different proof by analogy with the previous 
section. This amounts to deriving Observation 15.31 directly in the characteristic sequence. 

Remark 6.21. In the case of stability, much of the work came in establishing sharpness of 
the (00 ,£)- array . Here, since the persistent configuration is infinite, we have compactness on 
our side; we may in fact always choose the persistent empty graphs to be indiscernible and 
uniformly k-consistent but (k + 1) -inconsistent, for some given k < uj. 

Observation 6.22. Suppose that (T,ip) 1— > (P n ). Then the following are equivalent: 

(1) there is a set T = {a v : rj G 2 <w } C Pi such that, writing C for initial segment: 

(a) For each v G 2 W ; {a v : rj C u} is a complete P^-graph. 

(b) For some k < uj, and for all p G u <UJ , the set {a p -j : i < uj} C Pi is a P k -empty 
graph. 

(2) ip has the k-tree property. 
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Proof. This is a direct translation of Definition 14. 11 



□ 



Lemma 6.23. Let Xk be the T$- configuration describing a strict (k + 1) -inconsistent se- 
quence, i.e. Vx k = oj and Ex k = {<? : cr c uj, \o~\ < k}. Suppose that for some fixed k < uj 
and some formula p, Xk is persistent in the characteristic sequence (P n ) of (p. Then p is 
not simple. 

Proof. Let us show that p has the tree property, around some positive base set A if one is 
specified. At stage 0, by hypothesis there exists an infinite indiscernible sharply (k + 1)- 
inconsistent sequence Y C Pi, each of whose elements can be chosen to be a consistent 
1-point extension of A in the sense of P^ by Fact 15.181 Set dj to be the ith element of this 
sequence, for % < uj. 

At stage t + 1, suppose we have constructed a tree of height n, T n = {a v : r\ G uj-"} such 
that, writing C for initial segment: 

• every path is a consistent n-point extension of A, i.e. A U {a v : 77 C u} is a complete 
Poo-graph, for each v G uj n \ 

• for all < k < n and all r\ e u k , {a^-j : i < uj} is P^-complete but Pt+i-empty 

We would like to extend the tree to level n + 1, and it suffices to show that the extension 
of any given node a u (for v G uj u ) can be accomplished. But this amounts to repeating the 
argument for stage in the case where A = AU {a v : r\ C v\. By assumption and Fact 15 .18} 
this remains possible, so we continue. 

Notice that the threat of all possible localizations is what makes continuation possible. 
That is, the schema which says that u x is a 1-point extension of A" simply says that x 
remains (along with witnesses for Xk) in each of an infinite set of localizations of Pi with 
parameters from A. If this schema is inconsistent, there will be a localization contradicting 
the hypothesis. □ 

We can now characterize simplicity in terms of persistence: 

Theorem 6.24. Let ip be a formula ofT and (P„) its characteristic sequence. 

(1) If the localization tpf of <p is simple, then for each P^-graph A C P( and for each 
n < uj, there exists a localization P[ n D A of P( in which there is a uniform finite 
bound on the size of a P n -empty graph, i.e. there exists m n such that X C P[ and 
X n n P n = implies \X\ < m n . 

(2) // localization p 9 of p is not simple, then for all but finitely many r < uj, Pf contains 
an infinite (r + 1)- empty graph. 

In other words, the following are equivalent for any positive base set A: 

(i) There exists a localization p$ of p (with pf = p possible) such that pf is simple and 
P( D A. 

(ii) For each n < uj, there exists a localization P( n D A in which there is a uniform finite 
bound on the size of a P n - empty graph. 

Proof. It suffices to show the first two statements. (1) is Lemma [6.231 applied to the formula 
pf . (2) is the second clause of Observation I6.22[ where "almost all" means for r above k, 
the arity of dividing. □ 
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